AITopics | nully 0

Collaborating Authors

nully 0

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

a4267159aa970aa5a6542bcbb7ef575e-Supplemental.pdf

Neural Information Processing SystemsNov-15-2025, 08:33:42 GMT

artificial intelligence, machine learning, nully 0, (17 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > Canada > Ontario > Toronto (0.04)
Europe > Sweden > Västerbotten County > Umeå (0.04)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.93)
Information Technology > Software (0.67)

Add feedback

Appendix A Proofs for Section 2

Neural Information Processing SystemsNov-13-2025, 21:33:31 GMT

We construct a "ghost" point: x Section 4.5 of [4], we have From Lemma 3.1 and Proposition 3.2 in [48], we have null[ x ] The last relationship we want to show is just equation (13). We separate the discussion into deterministic and stochastic settings. The total complexity is then K T . By Corollary 3.2 and discussion in Section 3.2, Algorithm 1 combined with By Corollary 3.2, Algorithm 1 combined with EG/OGDA can solve such auxiliary We implement these algorithms in the same way as in Section 5. 17 (a) Distance to limit point We compare EG and Catalyst-EG under same stepsizes in Figure 4(a).

artificial intelligence, auxiliary problem, inequality, (18 more...)

Neural Information Processing Systems

Country: North America > United States > Wisconsin (0.04)

Technology: Information Technology > Artificial Intelligence (0.48)

Add feedback

Supplementary Material for: Improved Algorithms for Convex-Concave Minimax Optimization 1 Some Useful Properties In this section, we review some useful properties of functions in F (m

Neural Information Processing SystemsNov-13-2025, 17:51:01 GMT

Then, we have that 1. y Fact 2. Let z:= [ x; y ] and z This can be easily proven using the AM-GM inequality. Fact 3. Let z:= [ x; y ] R It is a crucial building block for the algorithms in this work. The following classical theorem holds for AGD. We will start by giving a precise statement of Algorithm 1.Algorithm 1 Alternating Best Response (ABR)Require: g (,), Initial point z The basic idea is the following. The following two lemmas about the inexact APP A algorithm follow from the proof of Theorem 4.1 [ Here we provide their proofs for completeness.

artificial intelligence, nullnull, survey article, (16 more...)

Neural Information Processing Systems

Country:

North America > Canada (0.04)
Europe > Russia (0.04)
Europe > Austria > Styria > Graz (0.04)
(2 more...)

Genre: Overview (0.40)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.40)

Add feedback

Supplementary Material for: Improved Algorithms for Convex-Concave Minimax Optimization 1 Some Useful Properties In this section, we review some useful properties of functions in F (m

Neural Information Processing SystemsOct-2-2025, 15:17:37 GMT

artificial intelligence, nullnull, xy null 1, (15 more...)

Neural Information Processing Systems

Country:

North America > Canada (0.04)
Europe > Russia (0.04)
Europe > Austria > Styria > Graz (0.04)
(2 more...)

Genre: Overview (0.40)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.40)

Add feedback

Fully Zeroth-Order Bilevel Programming via Gaussian Smoothing

Aghasi, Alireza, Ghadimi, Saeed

arXiv.org Artificial IntelligenceMar-29-2024

We are particularly interested in the setting where neither ex plicit knowledge about f,g are available nor their unbiased stochastic derivatives. In this zeroth-order setting, we assume that only noisy evaluations of f and g are available upon query to an oracle. The BLP problem was first introduced by Bracken and McGill in t he 1970s [7] followed by a more general form of problem involving joint constraints of outer and inner variables. This is a fundamental problem in engineering and economics with dire ct applications in problems such as decision making [48], supply chain [61, 59], network design [51, 43], transportation and planning [16, 83], and optimal design [4, 32]. More recently, BLP has f ound applications in many areas of machine learning and artificial intelligence. Zeroth-order methods apply to many optimization problems ( including the BLP) where for various reasons such as complexity, lack of access to an accurat e model, or computational limitations, there is no or limited access to the objective gradient.

artificial intelligence, machine learning, nullunull 2, (15 more...)

arXiv.org Artificial Intelligence

2404.00158

Country:

North America > United States > Oregon (0.04)
North America > United States > California (0.04)
Europe > Denmark (0.04)

Genre: Overview (0.92)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.93)

Add feedback

On the Convex Behavior of Deep Neural Networks in Relation to the Layers' Width

Littwin, Etai, Wolf, Lior

arXiv.org Machine LearningJan-14-2020

The Hessian of neural networks can be decomposed into a sum of two matrices: (i) the positive semidefinite generalized Gauss-Newton matrix G, and (ii) the matrix H containing negative eigenvalues. We observe that for wider networks, minimizing the loss with the gradient descent optimization maneuvers through surfaces of positive curvatures at the start and end of training, and close to zero curvatures in between. In other words, it seems that during crucial parts of the training process, the Hessian in wide networks is dominated by the component G. To explain this phenomenon, we show that when initialized using common methodologies, the gradients of over-parameterized networks are approximately orthogonal to H, such that the curvature of the loss surface is strictly positive in the direction of the gradient.

artificial intelligence, deep learning, machine learning, (17 more...)

arXiv.org Machine Learning

2001.04878

Country:

North America > United States > District of Columbia > Washington (0.04)
Europe > Sweden > Stockholm > Stockholm (0.04)
Asia > Middle East > Lebanon (0.04)
Asia > Middle East > Israel > Tel Aviv District > Tel Aviv (0.04)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.52)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.34)

Add feedback